Ancestral Reconstruction for Association Mapping
نویسندگان
چکیده
One of the greatest promises of modern genetics is the ability to identify loci responsible for, or at least highly correlated with, important physical traits and disease susceptibility. This will enable benefits ranging from improved disease risk assessment and crop design to ultimately a better understanding of how genotype variation effects variations in phenotypes. A key component in this task is association mapping where correlations between the clustering in the trait of interest and the clustering observed in the genealogy of a set of sampled sequences has the potential to localise causative genomic locations with relatively high precission. Association mapping is commonly based on single nucleotide polymorphism (SNP) data, where individuals have been typed on a number of marker loci where two different nucleotides are present with reasonably high frequency in a population. However, more complex data including multi-allelic loci and insertiondeletion variation, in particular microsatellite repeat count variation can also be utilised. The human HapMap project International HapMap Consortium [2005] identified and continues to refine a dense set of SNP marker locations, and similar data is becoming available for other organisms. The availability of high density chips harnessing this information Barrett and Cardon [2006] allows genome-wide scans to be performed in large sets of individuals, and with the constant decrease in cost and increase in fidelity of sequencing and chip technologies it is to be expected that high density association mapping on large data sets becomes standard for the study of within-population phenotypic variation. The two fundamental features of SNP data, and similar types of population variation data, that makes association mapping feasible is a relatively low mutation rate and the presence of recombinations in the evolutionary history of the data across the region covered. The low mutation rate causes both variation in the trait of interest and the type of a marker locus to be clustered into one or at most a few clades in the local genealogy of a sample. Recombination causes the genealogy to
منابع مشابه
Association mapping through heuristic evolutionary history reconstruction-application to GAW15 Problem 3
This paper presents a novel method of identifying phenotypically important regions of the genome. It involves a form of association mapping that works by summarizing properties of the ancestral recombination graph (ARG) of a sample of unrelated phenotyped and genotyped individuals. By breaking the sample into many small sub-samples and averaging the results, it becomes computationally tractable...
متن کاملA Methodological Framework for the Reconstruction of Contiguous Regions of Ancestral Genomes and Its Application to Mammalian Genomes
The reconstruction of ancestral genome architectures and gene orders from homologies between extant species is a long-standing problem, considered by both cytogeneticists and bioinformaticians. A comparison of the two approaches was recently investigated and discussed in a series of papers, sometimes with diverging points of view regarding the performance of these two approaches. We describe a ...
متن کاملReconstructing the genomic architecture of ancestral mammals: lessons from human, mouse, and rat genomes.
Recent analysis of genome rearrangements in human and mouse genomes revealed evidence for more rearrangements than thought previously and shed light on previously unknown features of mammalian evolution, like breakpoint reuse and numerous microrearrangements. However, two-way analysis cannot reveal the genomic architecture of ancestral mammals or assign rearrangement events to different lineage...
متن کاملComputational reconstruction of mitochondria-encoded mammal ancestral proteins
A method based on mapping a symbolic sequence into a set of patterns (strings resulting from the sequence parsing) is proposed as a tool for the reconstruction of ancestral sequences. The set union of patterns comprises all the patterns present in the family of related proteins sequences of an extant species. The set of most frequent patterns among protein sequences is selected and concatenated...
متن کاملLinkage-disequilibrium mapping of disease genes by reconstruction of ancestral haplotypes in founder populations.
Linkage disequilibrium (LD) mapping may be a powerful means for genome screening to identify susceptibility loci for common diseases. A new statistical approach for detection of LD around a disease gene is presented here. This method compares the distribution of haplotypes in affected individuals versus that expected for individuals descended from a common ancestor who carried a mutation of the...
متن کامل